首页> 外文OA文献 >Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions

【2h】

Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions

机译：VQa中的问题相关性：识别非视觉和虚假前提问题

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Visual Question Answering (VQA) is the task of answering natural-languagequestions about images. We introduce the novel problem of determining therelevance of questions to images in VQA. Current VQA models do not reason aboutwhether a question is even related to the given image (e.g. What is the capitalof Argentina?) or if it requires information from external resources to answercorrectly. This can break the continuity of a dialogue in human-machineinteraction. Our approaches for determining relevance are composed of twostages. Given an image and a question, (1) we first determine whether thequestion is visual or not, (2) if visual, we determine whether the question isrelevant to the given image or not. Our approaches, based on LSTM-RNNs, VQAmodel uncertainty, and caption-question similarity, are able to outperformstrong baselines on both relevance tasks. We also present human studies showingthat VQA models augmented with such question relevance reasoning are perceivedas more intelligent, reasonable, and human-like.

机译：视觉问答（VQA）是回答有关图像的自然语言问题的任务。我们介绍了确定问题与VQA中图像的相关性的新问题。当前的VQA模型没有理由怀疑问题是否与给定的图像有关（例如阿根廷的首都是什么？），或者是否需要外部资源的信息来正确回答问题。这可能会破坏人机交互中对话的连续性。我们确定相关性的方法包括两个阶段。给定一个图像和一个问题，（1）我们首先确定问题是否是视觉的，（2）如果是视觉的，我们确定问题是否与给定的图像有关。基于LSTM-RNN，VQA模型的不确定性和字幕问题相似性，我们的方法在两个相关任务上均能胜过强基准。我们还提供了人类研究，这些研究表明，以此类问题相关推理增强的VQA模型被认为更智能，更合理且更像人类。

著录项

作者
Ray, Arijit; Christie, Gordon; Bansal, Mohit; Batra, Dhruv; Parikh, Devi;
展开▼
作者单位

展开▼
年度 2016
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool [J] . IEEE Transactions on Pattern Analysis and Machine Intelligence . 2020,第2期

机译：视觉反问题解答：一种新的基准和VQA诊断工具
2. Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering [J] . Goyal Yash, Khot Tejas, Agrawal Aishwarya, International Journal of Computer Vision . 2019,第4期

机译：在VQA问题中制作v：提升图像理解在视觉问题的回答中的作用
3. R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering [J] . Pan Lu, Lei Ji, Wei Zhang, SIGKDD explorations . 2018,第Udisk期

机译：R-VQA：学习具有语义关注的视觉关系事实，用于视觉问题应答
4. Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions [C] . Arijit Ray, Gordon Christie, Mohit Bansal, Conference on empirical methods in natural language processing . 2016

机译：VQA中的问题相关性：识别非视觉和错误前提问题
5. Context Based Multi-Image Visual Question Answering (VQA) in Deep Learning [D] . Peddinti, Sudhakar Reddy. 2018

机译：深度学习中基于上下文的多图像视觉问答（VQA）
6. Cutting a Long Story Short? The Clinical Relevance of Asking Parents Nurses and Young Children Themselves to Identify Childrens Mental Health Problems by One or Two Questions [O] . Anne-Mari Borg, Raili Salmelin, Matti Joukamaa, -1

机译：长话短说？要求家长护士和幼儿自己通过一两个问题识别儿童的心理健康问题的临床意义
7. SQuINTing at VQA Models: Introspecting VQA Models With Sub-Questions [O] . Ramprasaath R. Selvaraju, Purva Tendulkar, Devi Parikh, 2020

机译：在VQA模型上眯起市：内部问题的内部VQA模型

Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions

摘要

著录项

相似文献

相关主题

期刊订阅